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Abstract 

The Optimal Filtering (OF) reconstruction of the sampled signals from a particle 
detector such as a liquid ionization calorimeter relies on the knowledge of the nor- 
malized pulse shapes. This knowledge is always imprecise, since there are residual 
differences between the true ionization pulse shapes and the predicted ones, what- 
ever the method used to model or fit the particle-induced signals. The systematic 
error introduced by the residuals on the signal amplitude estimate is analyzed, as 
well as the effect on the quality factor provided by the OF reconstruction. An anal- 
ysis method to evaluate the residuals from a sample of signals is developed and 
tested with a simulation tool. The correction obtained is showed to preserve the 
original amplitude normalization, while restoring the expected x 2 ~hke behavior of 
the quality factor. 
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1 Introduction 



The signals arising from the ATLAS electromagnetic calorimeter (EMC) [T] 
are shaped by a bipolar filter, then sampled every 25 ns at the LHC bunch 
crossing frequency and stored in analog buffers. Upon a positive decision from 
the level- 1 trigger, a limited number of these samples (typically 5) are digitized 
and acquired. The amplitude and timing information of the shaped signals are 
determined combining the signal pulse samples with a digital filtering tech- 
nique commonly called Optimal Filtering (OF) [2]: this method is optimized 
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to minimize the noise contribution to the variance of the reconstructed signal 
amplitude, while guaranteeing that the latter is an unbiased estimator of the 
true amplitude. 

Alongside the amplitude and timing information, the OF reconstruction is 
designed to produce a quality factor that should allow the discrimination of 
pathological signals from regular ones. The normalized OF quality factor ob- 
tained from regular signals follows a standard x 2 distribution, while spurious 
signals generate large quality factor values: these signals could in principle 
identified and be rejected with a cut on this quantity. 

The computation of the Optimal Filtering Coefficients (OFC's) for a given 
readout cell requires the knowledge of the signal pulse shape and of the (ther- 
mal and pileup) noise time autocorrelation [2] . While the latter can be directly 
measured from dedicated noise calibration runs and minimum bias events, 
several different approaches have been proposed to predict the ATLAS EMC 
ionization pulse shapes and their relative amplitudes with respect to the cal- 
ibration signals used to probe the detector readout properties [3,4,5,6J. The 
precision of these pulse prediction methods is quoted in terms of the difference 
between the predicted ionization signal and the observed one, the two pulses 
being normalized to the same amplitude. The vector of differences computed 
for each digitized sample is commonly called the residuals vector (defined in 
Equation [[]): an accurate prediction method is usually quoted to lead to a 
difference < 1% at the sample closest to the signal peak, and always between 
± 2% for the neighboring samples PflfSfo] . 

Assuming that such a precision is achieved for the readout cells of the AT- 
LAS EMC using a given pulse prediction scheme, this work aims to study 
how the unavoidable presence of the residuals systematically affects the signal 
amplitude reconstruction and its noise variance (Section Hj), as well as the rel- 
ative quality factor distribution, thus impairing the discriminating power of 
the latter (Section EJ). 

A technique to optimize the quality factor without spoiling the initial recon- 
structed amplitude normalization is developed and tested on a reference cell 
for different possible distributions of the amplitude, which is proportional to 
the deposited energy. The same technique proves to be a powerful tool to ex- 
tract from data the ionization pulse shape (up to a normalization factor) when 
no previous knowledge - even approximate - of it is available (Section EJ). 

This technique has been developed in the framework of the ATLAS EMC, but 
it holds for any other detector readout system that exploits the OF reconstruc- 
tion of multiple-sampled signals (e.g. the ATLAS Tile hadronic calorimeter 

0)- 
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2 Notation and nomenclature 



In the following a matricial notation will be used in all calculations, A^ samp i es 
being the number of signal samples digitized during the data acquisition, thus 
the typical size of all vectors and matrices: 

A matrix of size Samples x A^ samp i es 

Aij (i, j)-th entry of matrix A 

a vector of size iV sam pics 

a, i-th entry of vector a 

A -1 inverse matrix 

A T transposed matrix 

a T transposed vector 

a scalar 

The following nomenclature is used: 

h vector of samples of normalized observed ionization signal 

g vector of samples of normalized predicted ionization signal 

g' vector of samples of normalized predicted ionization signal derivative 

a vector of amplitude OFC's computed from g (and g', if time constraint 
is included [2]) 

b vector of time OFC's computed from g and g', if time constraint is 
included 

r vector of pulse residuals: 



s vector of observed ionization signal samples for a given pulse amplitude 



n vector of noise contributions to signal samples, having the properties: 



r = h — g 




.4: 



s = Ah + n 




(n) = 
(nn T ) = C 



(3) 
(4) 
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C noise covariance matrix: 



C = C T 



(5) 
(6) 




R weight matrix, built from the noise autocorrelation function: 




i 




-i 



(7) 



I identity matrix 



3 Numerical examples 

All the equations derived in this work are illustrated using a simulation tool 
that can generate pulses s for a given signal h, noise autocorrelation R and 
width a n , and a chosen distribution of amplitudes A. The tool computes OFC's 
a (and b) from a given pulse prediction g and noise autocorrelation Rofc ( n °t 
necessarily equal to the signal noise autocorrelation), and applies them to 
the generated samples to obtain the corresponding distribution of amplitude 
estimates A (and time estimates r), and the relative quality factors (defined 
below in Section I5TT1) . 

The test signals h and g (and their residuals r) used in the simulations are 
plotted in Figured! they correspond to ionization pulse predictions used during 
the ATLAS EMC Barrel commissioning operations in 2007, namely to the 
ones corresponding to the middle compartment cells located_|at [r] ce n, ce n] = 
[20,50] (h) and [r/ ce ii, C eii] = [20,51] (g). They have been explicitly chosen 
to be very similar, in order to mimic an optimal situation in which the pulse 
prediction largely satisfies the precision criteria mentioned in Section [IJ 

The numerical examples correspond to the fixed pulse phase illustrated in 
Figure [H that corresponds to the typical EMC data taking condition at LHC, 
when iV sam pi es = 5 and the signals are digitized so that the third sample is 

1 The position of ATLAS EMC readout cells is specified by using indexes corre- 
sponding to the local granularity in pseudorapidity r] and azimuthal angle <j) [IJ. In 
the case of the middle compartment this is Ar] x A(p = 0.025 x 0.025, implying: 



7] = 0.025 • (77ceii + 0.5) 

= 0.025- (0 ccU + 0.5) 



(8) 
(9) 
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located near the pulse maximum ± 2 ns. The actual values used in the simu- 
lations are tabulated in Table [TJ 

Figure [2] shows the values of the thermal noise autocorrelation function used 
to generate the noise affecting the pulses s. This is carried out by build- 
ing the matrix R as a Toeplitz matrix based on the relevant autocorrelation 
function, which in this case was measured in high gain from middle cell at 
[?7ceii> 4>ceii] — [20, 50] during the ATLAS EMC Barrel commissioning operations 
in 2007. The matrix Rofc ; used for computing the OFC's, is based on the mea- 
sured autocorrelation function from a neighboring cell at [^ ce ii, ce ii] = [20, 51]. 
The two autocorrelation functions are quite similar, and previous studies have 
shown that the residuals obtained are insensitive to details of the autocor- 
relation function. We thus use Rofc and R interchangeably. A noise width 
a n = 5 ADC counts is used, corresponding to a typical value for a EMC Barrel 
middle compartment in high gain. 

Figure [3] shows the distributions of the amplitudes A generated for the nu- 
merical examples. These amplitudes span the range between and 3000 ADC 
counts, covering a realistic range of the EMC electronics after the ADC pedestal 
subtraction. In order to illustrate the effects related to the distribution of the 
energy deposited in a EMC cell, we present results from a flat distribution, a 
exponential-like and a Gaussian-like distribution of possible amplitudes A. 



4 Amplitude Estimate Error and Noise Variance Bias 

The OF estimate of the signal amplitudes A given the digitized samples s is 
A = a T s (10) 

that can be expanded according to Equation (|2J) as: 

A = a T (Ah + n) = Aa T h + Af (11) 



where M = a T n is the (reduced) noise contribution to OF amplitude estimate 
A, that has variance: 



cri 




•jsf = a n V a T Ra ~ a n J —— (12) 



where the last approximation holds for small noise autocorrelation and neg- 
ligible time jitter, and is exactly true if the noise autocorrelation is null and 
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the OFC's are not optimized for time jitter. 



If the ionization pulse prediction g is not perfect (g 7^ h, Equation (IT])), the 
OF estimate of the pulse amplitude suffers of a systematic error e = a T r: 



A = AaL T g+AaFr+J\f e (13) 
A = A(l + e)+Af e (14) 

This error is independent of the pulse amplitude and depends only on the shape 
and amplitude of the residuals r. This property suggests a simple correction: 
for a given set of OFC's a scale factor (e.g. obtained by in-situ calibration 
with particles of known mass) would restore the proper scale. In our numerical 
example e ~ -0.00158: the distributions in Figure H] confirm that the systematic 
bias is independent of the amplitude size and distribution. The average values 
of the distributions of the normalized amplitude differences are the same 
(within statistical errors) for all the three different forms of the amplitude 
distribution. The widths of the distributions in Figure H] differ because of 
the varying relative impact of the the noise on the pulse amplitude, since: 



— = e+ A (15) 



The larger the amount of small signals in the sample under study, the larger 
the tails in the distribution. 

The use of imperfect OFC's also affects the noise contribution to the OF 
reconstructed amplitude. The size of this bias can be approximatively esti- 
mated using Equation (fl2l) . and recalling that a = yjj^p in the approximation 
of negligible noise autocorrelation and when no time constraint is imposed: 



~N \ Hsll 2 \ 



1 + 



2g T r + 


r 


2 




|g|| 


2 



~v / TT2i~l + e (16) 

The bias to the minimized variance is of similar magnitude as that found for 
the signal amplitude. 
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5 Quality Factor Analysis 



5. 1 Definition 



A x 2 -like quantity Q is usually computed from the signal samples s and the 
amplitude estimate A, in order to evaluate the quality of the OF signal recon- 
struction. For a signal of given amplitude A a vector x of differences is built 
as: 

x = s-ig (17) 

and the quality factor is computed as its squared norm: 

Q 2 = x T x (18) 



or, if the noise autocorrelation is taken into account, as: 

Q 2 = x T R x x (19) 



If the time constraint is imposed during the OFC's computation, a set of time 
OFC's b is obtained, and the time estimator r can be computed from the 
samples as [2]: 

T = ib T s (20) 



In this case, the vector x can be computed as: 

x = s-i(g + g'r) (21) 



5.2 Properties and Limitations 



The vector of differences x as defined in ( TT71) can be expanded as: 

x = s-ig (22) 
= Ah + n- a T (Ah + n) g 
= Ah + n — A& T (g + r)g — a T ng 
= Ah + n — Ag — Aa T rg — J\fg 

= n - Wg + A(r - eg) (23) 
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If the ionization pulse prediction is perfect (h = g) the difference vector x 
reduces to: 

x = n - A/g (24) 



and its components X{ are then normally distributed (following the statistical 
properties of n and J\f) around 0. The variable Q 2 n (normalized quality factor): 

Ql = - 2 Q 2 = -V^x (25) 



follows then a x 2 distribution with A7" sam pi es — 1 degrees of freedom (Figure [5]). 
The statistical properties of the normalized x 2 distribution can then be used to 
identify, and optionally reject, distorted signals. For instance in the numerical 
examples, where N samp \ es — 1 = 4, the probability that a signal affected only 
by noise gives a normalized quality factor larger than 25 is smaller than 10 -4 : 
cutting all signals with Q 2 n > 25 would guarantee a very high efficiency while 
rejecting a large fraction of spurious signals. 

On the other hand, if the pulse prediction is not perfect (h ^ g) the x compo- 
nents are no longer normally distributed, since they all contain a contribution 
related to the distribution of the pulse amplitude A (Equation (123}) ). In this 
case the Q 2 n variable defined in Equation (j25l) no longer follows a normalized 
X 2 distribution, long tails arise in the distributions (Figure |6]), and the shape 
of the normalized quality factor Q\ depends now on the original signal ampli- 
tude distribution. In this case the discriminating power of the quality factor 
is impaired. If in fact the normalized quality factor of strongly distorted sig- 
nals (e.g. in the rare case of important signal pileup from previous events) 
is so large (> 10 3 ) that even the use of imperfect OFC's would not really 
affect its rejection capability, there are other categories of spurious signals for 
which the distortion is smaller, that produce quality factor values comparable 
to the ones obtained using imperfect OFC's. This is for instance the case of 
slightly out-of-time signals (Figure [7]); of signals affected by noise bursts, when 
occasionally a large noise spike in the readout electronics coherently distorts 
the signal positively in one of the samples and negatively in the following one 
(Figured]); of large cross-talk from neighboring cells (Figure [H]), when a signal 
similar to the pulse derivative (capacitive cross-talk), and proportional to the 
energy release around the cell, distorts the pulse j8] . In these conditions, when 
using perfect OFC's the large quality factor values produced are of a similar 
magnitude as the ones obtained from regular signals using imperfect OFC's. 

Another problem concerns the variability of the quality factor distributions 
from cell to cell. The residuals may in fact differ from cell to cell due to 
differences in the properties of the readout channels, as well as the accuracy 
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of the pulse predictions and OFC calculations for each cell. The distribution 
of amplitudes also differs from cell to cell, due to the spatial variation of 
energy deposits, and of the different size and depth of the cells in the detector. 
Differences in the quality factor distributions complicate cross-comparisons of 
this quantity for different cells, or the construction of meaningful cumulative 
distributions. 



6 Residuals estimation from quality factor components 

6.1 Pseudo-residuals definition 



For a pulse of given amplitude A a vector y can be built as the ratio of the x 
components ([17]) and the reconstructed amplitude A (TTJJ: 

x n - Wg + A(t - eg) 

a = Mi+ljTJf (26) 



We call event pseudo-residuals the component of the y vector computed for a 
given pulse amplitude and noise contribution (i.e., for a given energy deposit 
and readout condition in the cell, thus for a given event). 

If the noise contribution could be ignored for each event (cr n = 0, n = 0, 
Af = 0), this y vector would assume a simplified form, independent of the 
pulse amplitude: 



* = (27) 

1 + e V ; 



We call average pseudo-residuals this ^ vector, and discuss below (Section l6.3l) 
how and under which conditions the components of ^ can be obtained from 
the distributions of the components of y. Note that the average pseudo- 
residuals are orthogonal to the amplitude OFC's, since: 



* = (28) 
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6.2 Estimation of residuals and calculation of OFC correction from average 
pseudo-residuals 



The system of linear equations which followd" 2 ! from Equation (127)) : 

(l + e)* = r-eg (30) 

* + *a T r = r - ga T r (31) 

(i - ga T - *a T ) r = * (32) 



H 

unfortunately cannot be solved to find r. In fact the matrix H is in general ill- 
conditioned, and in particular exactly singular, when no noise autocorrelation 
is present in the system. This property is directly related to the component 
ga T reducing the rank of H, since: 

det(ga T ) ~ (33) 



for general values of r, and exactly null when R = I. 

The impossibility to solve system (1321 is directly related to the (hopeless) 
attempt to extract information about r from e in Equation (1311) : in fact, given 
a scalar e and a vector a, there unfortunately exists a infinite set of vectors 
r that satisfy the relation a T r = eP*l This is an intrinsic limitation of the OF 
signal reconstruction that cannot be circumvented: by applying the OFC's a 
to the samples s when estimating the signal amplitude a part of information 
about the original pulse h is lost. 

On the other hand, from Equation ( 1271) it is possible to define a more general 
class of the vectors Y\ that satisfy the relation a r r^ = A: 

r A = (l + A)* + Ag AG 3? (34) 



When any of these Y\ vectors (134)) is used to correct the ionization pulse 
prediction g, the resulting corrected pulse g c is exactly proportional to the 

2 Equation (|32h exploits the relation: 

(a T b)c= (ca T ) b (29) 

aeSR MeS™xR" 

3 Consider for instance the vector: 
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ionization pulse h we were aiming for, apart from a scale factor: 

g* = g + r A = (1 + A)(g + *) = (35) 



Of course the preferred choice would be A = e, but the impossibility of this 
option has already been discussed. On the other hand, by choosing A = an 
interesting property of the correction is found: the OFC a c obtained from the 
corrected pulse g^ = ° preserve the reconstructed signal amplitude normalization 
obtained from the original a OFC's obtained from the initial pulse prediction 

g. If we let a h represent the OFC's corresponding to the true ionization pulse 

h, we have the following relationship between the corrected and the "true" 
OFC's: 

a c = (1 + e) a ft (36) 
and then (J\f h = a h n): 



A c = a c s (37) 
= (1 + e) (Aa h h + a ft n) (38) 
= (l + e)A + (l + e)M h (39) 

The a c OFC's correctly compensate for errors in the initial prediction of the 
ionization pulse shape, while preserving the amplitude normalization obtained 
from the original a OFC's. When trying to optimize the global cell calibration 
this certainly represents an advantage. In fact, in order to correct for the effects 
of material in front of the detector, or to compute the correct global energy 
scale, the reconstructed signal amplitudes are always multiplied by additional 
calibration factors obtained in-situ from reference data samples (e.g. electrons 
from and Z° decays). In the process of computing these factors with an 
increasing data sample, and optimizing at the same time the reconstruction of 
the signal amplitudes, the normalization of the latter is better kept preserved 
in order not to mix the effects. 

Whatever the choice of A (and thus also for A = 0) the use of the corrected 
OFC's restores the proper behavior of the quality factor. In fact the components 
of the difference vector x c : 



s - Ag c (40) 

Ah + n- ^±£a^ (Ah + n) ^±^h 
1+A h v J 1+e 

n-Af h (41) 
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are now correctly normally distributed around 0, and: 

^Ql = ^xfR" 1 ^ (42) 



is guaranteed to follow a x 2 distribution with iV samples — 1 degrees of freedom. 
6.3 From event pseudo-residuals to average pseudo-residuals 



The possibility of correcting the OFC's by exploiting Equation fl35|) is related 
to the capability of extracting consistent estimators of the average pseudo- 
residuals \I/ from the distribution of the event pseudo-residuals y as defined 
in Equation ( l26lh 



For a given value of the signal amplitude A every yi is distributed as the ratio 
of two Gaussian variables with different non-zero means, different variances, 
and a certain correlation. The y vector can in fact be expressed as: 

_ Ac + m _ c+> 
Y[A) - Ad + Af- d+\M (43) 



where: 

c = r — eg 

(44) 

d = 1 + e 

and 

m = n — Afg 



(45) 



The components of the m vector are normally distributed around with stan- 
dard deviation am, p(m,Af) being the vector of their correlation with the 
denominator noise factor M. 

The distribution of such a ratio variable is not necessarily Gaussian, especially 
in case of noisy measurements. On the other hand under certain assumptions, 
namely that the denominator of the ratio is very unlikely to become negative: 

Ad>3atf (46) 
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the probability distribution of this ratio is exactly knowcQ [9110] . and the 
distribution of each component of y(A) for a given value of A approaches a 
Gaussian G((yi(A)),a yi ( A )) with: 

(y(A)> ~ - d = * (47) 



Note that the standard deviation 0"y(A) scales as -j. The locaO] normality 
condition (|1B|) for y(-A) is amply satisfied by: 

i > 3a n (49) 



since a n > oyy because of the OF noise reduction. 

Our specific case is additionally complicated by the fact the amplitude A in 
(1261) follows its particular distribution P(A) that is not a priori known and that 
might change from cell to cell depending on the position in the detector and 
the event sample selection. The cumulative distributions D(yi\P(A)) of the 
event pseudo-residual components y i} given a probability distribution function 
P(A) for the signal amplitudes and holding the normality requirements (l4"9|) . 
are then: 

D{ Vi \P{A)) = J dAP{A)G{^a yi{A) ) (50) 



While the actual shapes of these D(yi\P(A)) distributions vary according to 
P(A) (and in most cases are definitively non-Gaussian!), it is always true that 
every D(yi\P(A)) is symmetric, and that the vector of their means corresponds 
to the average pseudo- residual vector whatever the initial P(A), since: 

(D( yi \P(A))) = J dy lVt J dAP{A)G^ h a yi{A) ) 
= J dAP(A) J dyiViG^a^A)) 

S v ' S v ' 

= X VA , Equation (1471) 

= % (51) 



4 Additionally, under the hypothesis (|46p a transformation exists to compute a 
derived variable that is exactly Gaussian. 

5 The local attribution refers here to the fixed value of the pulse amplitude A. 
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It is then possible to estimate the average pseudo-residuals \I/ as the means of 
the distributions of the event pseudo-residuals, after having applied the local 
normality requirements (j49j) on the events entering the distributions. 

The property expressed in Equation (1511) is clearly verified in our numerical ex- 
amples. Figure [TU1 shows the event pseudo-residual distributions corresponding 
to the three different functional forms for P(A), and built from events passing 
the tight threshold cut on the reconstructed amplitude A > 25 = 5a n . Their 
shape certainly depends on the initial distribution P(A) of the signal ampli- 
tudes A, but as expected they are always symmetric, and their means (here 
evaluated by means of a Gaussian fit to the distributions) correspond in all 
cases to the average pseudo-residual vector ^ as shown in Figure [TTJ the error 
bars accounting for the width of the corresponding distribution. It is evident 
that a distribution for the ratio corresponding to an amplitude distribu- 
tion P(A) with predominately small values of A (such as the exponential in 
this example) would have larger tails, as the prevailing contributors to D are 
Gaussian distributions of large standard deviation (see Equation f|48l) ). 



6.4 Average pseudo-residual estimators as OFC pulse correction 

The mean values defined in Equation (1ST]) can be used as estimator of ^ to 
correct the original OFC's via Equation (155]) . by choosing A = to follow 
the prescription to preserve the original reconstructed signal amplitudes, as 
discussed in Section |6j Because of the small size of this correction the variation 
of the pulse derivative g' can safely be neglected, and the original one can be 
used to computed OFC's optimized for time jitter. 

When new OFC's are computed from g c , g' and R, and reapplied to the 
original signal samples s, as expected from Equation (J55|) the reconstructed 
amplitudes are unchanged, preserving the original normalization (Figure [T2|) . 
The corrected quality factors follow now a % 2 -like distribution, for any initial 
pulse amplitude distribution (Figure [T3]) . 



6. 5 Pulse shape estimate with flat filter average pseudo-residuals 

The pseudo-residuals correction technique proves also to be a powerful tool 
to extract from data the ionization pulse shape (up to a normalization factor) 
when no previous knowledge of it is available, whatever the available signal 
sample. This technique addresses the relative normalization of the signal sam- 
ples corresponding to different energy deposits in the readout channel under 
study. 
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Let u be the normalized vector of size iV sam pi es with all components of equal 
magnitude (flat pulse): 

u = {l,l,...,l} (52) 



and compute the corresponding a u OFC's (flat filter). When they are used 
to reconstruct the signal amplitude and obtain the pseudo- residuals ty u , from 
Equation (|35|) one gets: 

u + * u = -^h (53) 
1 + e u 



that is exactly the ionization pulse h, up to the normalization factor j— 
(which, for the reason discussed above, is again bound to remain unknown). 
Figure [H] shows the distributions of the event pseudo-residuals (biased by 1) 
when the flat filter is used for the reconstruction of the signals belonging to the 
example distributions; the vectors of their averages nicely correspond to the 
original ionization pulse h (Figure fT5l) . In our example e u = -0.412544: if this 
value is used to scale the biased pseudo-residual averages in order to compare 
them with the true h, excellent agreement, ranging from ~ 10 -4 (exponential 
distribution of pulses dominated by small pulses, thus larger pseudo-residual 
distributions) to ~ 10~ 5 (flat distribution of pulses, narrower pseudo-residual 
distributions), is found. 



7 Summary and Conclusions 



The unavoidable presence of residuals in the pulse shape prediction used to 
compute OFC's is bound to introduce a systematic error in OF amplitude 
estimates. This systematic error is independent of the pulse amplitude and 
thus can be easily corrected, absorbing it in a simple scale factor. Its size 
is maintained small (and even negligible) by the usual requirements on the 
amplitude and shape of the pulse prediction residuals. On the other hand, 
even the smallest residuals will introduce a bias in the OF quality factor, 
impairing its discriminating power. 

The mathematical properties of the OF reconstruction do not allow the mea- 
surement of the residuals shape and amplitude: this loss of information is an 
intrinsic property of the OF reconstruction technique itself. It is possible to 
extract from any data sample an alternative vector of pseudo-residuals, that 
allows the correction of the original pulse shape prediction and obtain the 
exact ionization pulse shape, apart from an uncertainty in the scale factor 
that is bound to remain unknown. An ad hoc choice for the normalization 
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of the pseudo-residuals correction is shown to preserve the original normal- 
ization of the reconstructed amplitudes. Given any sample of signals from a 
readout cell it is always possible to obtain its exact pulse shape from an initial 
(even imprecise) prediction, while its absolute normalization must be obtained 
from different sources. The pseudo-residuals correction restores the expected 
X 2 -like behavior of the OF quality factor and its full discriminating power, 
independent of the initial bias. 

The OF reconstruction properties discussed here, and the possibility to obtain 
an optimal correction for any initial pulse shape prediction, suggest a possible 
calibration strategy for the ATLAS EMC at the LHC start-up: OFC's are 
computed exploiting the present best knowledge of the ionization pulse shapes 
and normalizations, and as soon as enough data is collected to compile reliable 
distributions of the event pseudo-residuals, this technique is used to optimize 
OFC's (thus quality factor distributions and noise reductions) without spoiling 
the initial normalization of the reconstructed amplitudes. 
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i hi gi ri Roi at bi 

0.04761 0.04873 -0.00111 1 0.14516 -7.2531 

1 0.65923 0.67365 -0.01442 0.07108 0.22650 -26.9503 

2 0.99769 0.99613 0.00156 -0.15330 0.38105 9.1090 

3 0.80987 0.80709 0.00279 -0.29747 0.33019 5.9919 

4 0.55451 0.55357 0.00095 -0.10336 0.35092 8.3074 

Table 1 

Numerical values used in the simulations. 
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Fig. 1. Signals h and g and their residuals r (magnified by a factor 10) used in the 
numerical examples. 
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Fig. 2. Noise autocorrelation i?o« = Rjo used in the numerical examples. 




Fig. 3. Distributions of the amplitudes A generated for the numerical examples. 
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Fig. 4. Distributions of the reconstructed amplitude relative errors, related to the 
presence of residuals r in the pulse used to compute the OFC's. 




Fig. 5. Quality factor distributions for the different signal amplitudes, as obtained 
when using perfect OFC's (i.e. computed from the exact ionization pulse h for null 
residuals r). 
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Fig. 6. Quality factor distributions for the different signal amplitudes, as obtained 
when using imperfect OFC's (i.e. computed from the predicted ionization pulse g 
for non null residuals r). 
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Fig. 7. Quality factor distributions obtained using perfect OFC's for signals out-of- 
time by ~ 1 ns. 
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Fig. 8. Quality factor distributions obtained using perfect OFC's for signals affected 
by noise bursts. The noise bursts are simulated by summing a noise spike - normally 
generated with a 4cr n mean and la n RMS - to one signal sample at random, and 
subtracting it from the following sample. 




Fig. 9. Quality factor distributions obtained using perfect OFC's for signals af- 
fected by large capacitive cross-talk from neighbor cells. The cross-talk distortion 
is simulated by summing to the current pulse with one having the shape of the 
signal derivative h , and amplitude proportional to 1.5% of a signal independently 
generated from the same amplitude distribution. 
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Fig. 11. Means of the event pseudo-residual component distributions for the different 
signal amplitude distribution P{A) and for A > 25 = 5cr„. Error bars indicate the 
widths of the distributions. 




Fig. 12. Distributions of reconstructed amplitude relative error, as obtained when 
correcting the OFC's input pulse g with the pseudo-residual estimators (y). 
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Fig. 13. Quality factor distributions for the different signal amplitudes, as obtained 
when correcting the OFC's input pulse g with the pseudo-residual estimators (y). 
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Fig. 14. Distributions of event pseudo-residual components biased by 1, as obtained 
for the different signal amplitudes for A > 25 = 5a n , when a flat filter is used for 
the amplitude reconstruction. 
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Fig. 15. Means of the distributions of the event pseudo-residual components biased 
by 1 for the different signal amplitude distribution P(A) and for A > 25 = 5a n , 
when a flat filter is used for the amplitude reconstruction. The error bars are smaller 
than the point markers. 
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